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Abstract 

This article will examine states that superpose different amounts of entanglement and pro- 
tocols that run in superposition but generate or consume different amounts of entanglement. 
In both cases we find a uniquely quantum difficulty: entanglement cannot be conditionally 
discarded without either using communication or causing decoherence. 

I will first describe the problem of entanglement spread in states and operations, as well as 
some methods of dealing with it. Then I'll describe three applications to problems that at first 
glance appear to be quite different: first, a reinterprctation of the old observation that creating n 
partially entangled states from singlets requires 0{^Jn) communication, but cannot itself be used 
to communicate; second, a new lower bound technique for communication complexity; third, an 
explanation of how to extend the quantum reverse Shannon theorem from tensor power sources 
to general sources. 

1 Introduction 

This paper will challenge the idea that, when it comes to entanglement, more is always better. 
While some resources in quantum information theory, like use of communication channels, can be 
safely discarded, entanglement cannot be kept in one branch of a superposition and discarded in 
another without causing decoherence. First, I will outline two specific challenges that this fact 
poses to the traditional resource model of quantum information. 

1.1 Motivation: Coherent conditional execution of quantum communication 
protocols. 

A productive way of understanding quantum communication protocols can be to view quantum 
states and operations as resources and protocols that convert one resource into another as resource 
inequalities (RIs)[5,9]. So if [q — > q\ represents a noiseless qubit channel from Alice to Bob, [qq\ is an 
EPR pair and [c — > c] is a noiseless classical bit channel (cbit) from Alice to Bob, then teleportation 
can be expressed as 2 [c — > c] + [qq] > [q ^ q\- This picture was formalized in [9] , which also proved 
many basic intuitive facts about RIs. For example, if a > /3 and /3 > 7 for some resources a, /3, 7 
then Q > 7 as well. 

However, there is another basic way of combining protocols which works trivially in the classical 
case but fails in the quantum case. Suppose Alice and Bob each know a common bit h and want 
to perform protocol Nb conditioned on the value of h. This happens often when communication 
protocols are embedded in larger applications where the amount and type of communication is itself 
input-dependent. Let us call this conditional operation 6?A/i : A/q, following C notation. Classically 
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it is trivial to show that if a > (A/j,) for 6 = 0, 1 then a > (6?A/i : A/q): Alice and Bob use a to 
perform either A/q or Ai depending on their shared value of b. However, in a quantum protocol 
this might leak information about b to the environment, because the environment may be able to 
distinguish A/q and Ai on some inputs. In Section 2.1 I will describe one possible solution to this 
problem, which will be called clean resource inequalities. The main idea will be to discard only (up 
to an asymptotically vanishing error) qubits in standard states, such as |0), so that the environment 
cannot learn which protocol is being run. 

1.2 Motivation: Non-asymptotic analysis of entanglement 

A second problem with the traditional resource framework arises in quantifying pure state entan- 
glement. The entanglement of a pure state j-i/;)"^^ is usually said to be characterized by its entropy 
of entanglement, E{\Ij), which is defined as E{iIj) := S{ip^) = S{ip^). Here ip := |V')(V'I is the 
density matrix corresponding to ■0) '■= tr^ ^p is Alice's reduced density matrix (and similarly for 
ijj^), S{p) := — trplog/O is the von Neumann entropy, and the base of logs and exponentials will 
always be 2. Asymptotically, the entropy of entanglement characterizes the entanglement present 
in a state in the following sense: given lip)^^ with E := i?(|'0)), entanglement concentration]^] can 
produce nE — o{n) maximally entangled states (i.e. \i^'^®nE-o(n) ^ where |$)^-^ := :^(|00) + |11))) 

with o(l) error, while entanglement dilution\2,2Q\ can map |$)®"^+°(") to 1'^)®'' with oil) error and 
o(n) bits of classical communication. Thus, to leading order in the number of copies, tensor powers 
of entangled pure states can be described by a single parameter: the entropy of entanglement. 

In general, however, entangled pure states cannot be fully described by the entropy of entan- 
glement, even given free local operations. For example, instead of von Neumann entropy, we can 
use Renyi entropies. For any a > 0, define Ea{ip) := Saii^^) = ^ostr('0'^)"! and extend 
by continuity to a = 0,1, cxd. In particular, EQiif)) is — logrank^^ and E^{il)) = — log HV'^^Hoo, 
where rank^/^"^ is the number of non-zero eigenvalues of ipA and is its largest eigenvalue. {Sq 

and 5*00 are also called the max-entropy and min-entropy, respectively.) Like E{ip), the Ea{ip) 
are also invariant under local unitaries, and non-increasing on average under local operations and 
classical communicaton (LOCC). Thus they define restrictions on entanglement transforms that 
may be more stringent than those obtained from the non-increase of entropy of entanglement. 
When communication is restricted then Renyi entropies can further limit possible entanglement 
transformations. 

Following [19], define the entanglement spread of a state |^)^^ to be 

A(^) := logrankV-^ + log ||'0^||oo- (1) 

(Also in [19] was the more general A°'^(^) = Ha{tp^) — Hpiil)"^), where Hp{p) := log ti ff and 
a < j3. We recover (1) by setting a = and j3 = oo.) The entanglement spread is never negative 
and A('0) = if and only if has all non-zero Schmidt coefficients equal, meaning it is a product 
state or a maximally entangled state. If A('0) > then we say that is partially entangled (since 
it is neither unentangled nor maximally entangled). Since A ("01 ®tp2) = ^(V'l) + ^{'^2)^ it follows 
that if A(i/;) > 0, then = Q[n). However, this value is not very robust: for any constant 

e > 0, we can perturb |^/;)®" by e and reduce its spread to Q{^/n). To capture this insight we will 
also use the e-perturbed entanglement spread A£(^/^), which is defined for any e > to be (following 

[19]) 

min{logtrP + log||P^^P||oo : trP^^ > 1 - e} , 

where the minimization is over projectors P. Note that Ao(^) = A(^/'). Now Ae(V'®") = Q{y/ri) for 
any e > 0, but of course there are still states on 2n qubits, such as the even superposition between 
1$)®"' and a product state, which have e-perturbed spread nearly equal to n. 
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The main application of entanglement spread is using the following result from [19] to produce 
lower bounds on communication: 

Theorem 1 (Corollary 10 of [19]). // \(t))^^ is transformed using local operations and C hits of 
classical communication (in either direction) into a state that has fidelity 1 — e with [-0) then 

C> A5W-Ao(</.) + 21og(l-5), 

where 5 = (4e)^/'^. 

In particular, if we begin with maximally entangled states, then preparing to within a fidelity 
of 1 — e requires + 21og(l — 6) bits of communication. This restriction holds even with an 

unlimited supply of EPR pairs, and so problematizes the idea that maximally entangled states are 
a good canonical form for the resource of pure state entanglement. In Section 2.2 we will discuss 
alternative ways to quantify entanglement as a resource. 

2 Dealing with entanglement spread 

In this section I propose two solutions to the above problems. To build quantum protocols that can 
be run in superposition, I will demand that they discard only (approximately) standard states to 
the environment, as I will describe in Section 2.1. Then in Section 2.2, I will propose measuring not 
just the maximum amount of entanglement that can be created by a protocol, but instead finding 
the range of entanglement that it can cleanly generate/consume. These approaches are not rigid 
rules, but rather illustrate principles that can be adapted to diverse situations, as we will see in 
Section 3. 

2.1 Approach: clean resource inequalities 

In [9], a resource inequality a > (3 meant that resource a could be approximately transformed using 
local operations into /5. Allowing free local operations is standard practice in quantum information 
theory, but discarding to the environment can be dangerous when running different protocols in 
superposition. Following and extending [17], I will say that a clean resource inequality exists, and is 

clean 

denoted a > P, when a can be mapped to /? using only (up to error that goes to as n — > cx)) 

• Local unitaries. 

• Adding ancillas initialized in the |0) states. 

• Discarding ancillas in the |0) state. 

• Discarding messages that have been sent through classical channels. 

• A dynamic resource {M : uj) (meaning, as defined in [9], an operation M constrained to act 
on average input lo) may be used as a consumed resource only in a way that sends a constant 
state to the environment. More formally, suppose we are given (^j\f^i^2A3^BE . ^Ai^ Then 
we need to replace this resource with (^f\f^i^2A3—>B . such that oj'^'^ = u>"^'^ and J\f{cr)^ 
is the same for all a satisfying a^'^^'^ = Cj^'^^'^ . 

This last point says that noisy resources can be used only when they leak information to the 
environment that is independent of the inputs or the particular protocol being run. For example, 
if we are given [c — > c] as part of the input resource a then we can use it only if we promise to 
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input the same distribution of and 1 regardless of which protocol we're using. On the other hand, 
[q — > q\ can be used with any input since it doesn't leak anything to the environment. 
The primary application of clean protocols is the following general principle. 

Lemma 2 (protocol superposition principle). Suppose that Alice and Boh would like to execute m 
different operations, Vi, . . . , Vm in superposition. This means that they would like to perform an 
operation V that satisfies 

m m 

V^Ck\k)A\k)B\'4')AB ~e„ y^^Ck\k)A\k)B'Pk\lp)AB 
k=l k=l 

for any coefficients {ck}, and where e„ ^ as n ^ oo. 



clean 

Let TZk denote the set of resources capable of simulating Vk cleanly.' TZ^ := : a > (Vk) 



Then 



( clean ^ -.^ 

7^:= |a:a > {r)j=f]nk. 



clean 

One direction of the proof is easy: TZ C n^T^fc, since V > Vk for each k. To prove TZ 5 DklZk, 

clean 

we start with a E DkV.k and clean protocols for a > TZk for each k. Then have Alice and Bob 
run each protocol conditioned on their (shared) value of k. Since each resource inequality is clean, 
nothing is discarded that would break superpositions over different values of k. 

Remark: Note that clean resource inequalities are not the same as reversible RIs (meaning 
asymptotic equivalences; a > /3 and (3 > a). For example, while clean RIs cannot freely discard 
entanglement, they may discard communication resources, or use communication to reduce entan- 
glement, neither of which are reversible. On the other hand, most resource equalities can be made 
clean. The only possible complication arises for protocols that use an unlimited amount of entan- 
glement: i.e. 2[c — > c] + cxD[gg] = [q ^ q] + oo[qq] can be made clean only with some additional 
effort. 



2.2 Approach: entanglement capacity as an interval 

Armed with the definition of clean RIs, we now examine the entangling capacities of various quan- 
tum operations. The most obvious restriction is that entanglement cannot be cleanly discarded, so 

clean 

that e.g. while 2[qq] > [qq], it does not hold that 2[qq] > [qq]. Moreover, eliminating entanglement 
cleanly is now a non-trivial resource. So 

clean 

[c^c:I/2] > -[qq] 

via a protocol where Alice sends her half of a shared state \^) through the classical channel and 
Bob performs a CNOT with the bit he receives as control and with his half of |$) as target. Then 

clean 

he is left with a |0), which he discards. At the same time, [c ^ c : 1/2] > (where is the 
null resource), since Alice can always send a random bit through a channel. This means that 
both protocols can be run in superposition and classical communication can be used to generate 
superpositions of different amounts of entanglement. To express this concisely, we can say that the 
entanglement capacity range of [c — > c] (alternately, its spread capacity) contains [—1, 0]. In fact, the 
spread capacity of [c — > c] is exactly [—1,0] as can be seen from the non-increase of entanglement 
under LOCC for the upper bound and Theorem 1 for the lower bound. 
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We can apply this approach to other resources as well. Instead of measuring the maximum en- 
tanglement that can be sent using a resource, we will find the range of entanglent that it can cleanly 
generate/consume. Table 1 lists the entanglement spread capacities of some common communica- 
tion resources. Most of the bounds are straightforward to prove using Theorem 1 and the LOCC- 



Table 1: Various entanglement spread capacities. 
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[c^c] 


-1 





cobit[15] 




or 


[qq^q] 
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co-cobit[17] 


[q^qq 


or 


[qq^q] 


-1 





ebit 








1 


1 


partially ent. states 








nE - 0{a^) 


nE + 0{a^) 


embezzler [18] 








—ne 


ne 


unitary gate 


(U) 






-E{W) 


E{U) 



monotonicity of entropy of entanglement, but a few lines require explanation. Cobits and co-cobits 
are defined in [15] and [17] respectively, and their capacities can be proven by reversibly mapping 
them to combinations of [q — > q\ and [gg]. For the partially entangled state \iP)ab with reduced 
density matrices and we define E = S{A)^ = — tr if)^ log and cr^ = tr '(/'^(log il)^)"^ — E"^ . 
The n-qubit embezzling state[18] \tpn) is defined to be (up to normalization) Y^i=i "Tjl^)"^!^)^ ^'^'^ 
can be used catalytically (unlike other resources, which are consumed) to create or destroy ne ebits 
(or indeed any state of Schmidt rank < 2"*^) by incurring error e. Finally, for a bipartite unitary 
gate [/, E{U) is its entanglement capacity [3]: E{U) := max{e : (U) > e[qq]} 

By time-sharing, the spread capacity of a resource a is completely characterized by an upper and 
lower bound (although there can always be tradeoffs between entanglement and other resources). 
And by Lemma 2 clean protocols using any amount of entanglement within that range can be 
performed in superposition using a. 

3 Applications 

This section will describe three applications of entanglement spread and clean resource inequalities 
to some apparently unrelated problems. 

3.1 Entanglement dilution 

Let IV')^^ be a partially entangled state. Then, as mentioned above, G(y^) cbits in either direction 
are necessary[14, 19] and sufficient [20] to prepare [■0)®"' from EPR pairs. Unlike many other lower 
bounds on communication, this bound holds for a task which itself has no communication capacity: 
that is, that ability to create from singlets has no value for communication.^ 

The framework of entanglement spread explains this strange situation by the fact that i}{^/n) 
spread needs to be created in order to (approximately) prepare [-0)"^^. Thus the apparent re- 

^Being able to selectively either create , or some other state, such as an all zeroes state, would have com- 

munication capacity if the decision of which state to create were influenced by one party. However, this is different 
from the ability to create \ip)^^ from EPR pairs once both parties have agreed to perform this task. 
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quirement for communication is something of a red herring: for example, using a 0{y/n/ e)-quhii 
embezzhng state, together with nE{'ip) EPR pairs, would be enough to prepare up to error 

e. Alternatively, if we used some quantum operation, such as a bipartite unitary U, to prepare 
from nE{^lJ) EPR pairs, then the number of uses of U needed would be related to the entan- 
glement spread capacity of U (discussed in Section 3.2) rather than by its communication capacity. 
Thus, the communication cost of entanglement dilution can more fruitfully be understood as a 
spread cost. 

3.2 Using entanglement capacity as a lower bound for communication complex- 
ity 

A common method of lower bounding the communication complexity of a distributed function, or 
almost equivalently, the cost to simulate a bipartite unitary gate, has been to use its capacity to 
communicate[8,21]. In fact, entanglement capacity, or even better, entanglement spread capacity, is 
a lower bound that is always at least as strong (since E{U) is at least as large as the communication 
capacity of U [3]). The main idea is the following lower bound for simulating a unitary gate: 

Theorem 3. If U is a bipartite unitary gate such that 

oo[qq] + Qi[q^q]+ Q2[q ^ q] > {U), 

then 2(Qi + Q2) > E{U) + E{U^). 

Proof. The proof is a simple application of Theorem 1. Starting with m EPR pairs, we can use U 
n times to create a superposition of roughly m + nE{U) EPR pairs and m — nE{U^) EPR pairs 
(assuming that m is large enough), for a total spread of roug hly n{E{U) + £"([7^)). On the other 
hand, each transmitted qubit can increase spread by at most 2. □ 

As a corollary, we obtain a new lower bound for the communication complexity of a function 
f{x, y) where Alice holds x and Bob holds y, even with Alice and Bob can use an unlimited number 
of EPR pairs. Following [21] we can turn any quantum protocol for computing /(x, y) into a unitary 
gate that approximates Uf := yi~^)'^^^'^^ \x){x\ \y){y\ ■ If the protocol for / is exact then this 
requires only running the protocol once forward and once backwards (reversing all communication 
and inverting all local unitaries). In the bounded-error case, we could (following [21]) repeat the 
protocol for / O(logn) times to reduce the error to 1/n^, so that (from Lemma 1 of [17]), the 
resulting transform has capacities within 0(l/n) of those of Uf. Thus E(Uf) is a lower bound on 
the communication complexity of computing / exactly and Q{E{Uf)/ log n) is a lower bound on 
its bounded-error complexity. In both cases the lower bounds hold even when the protocols for 
computing / can use an unlimited number of EPR pairs. 

These bounds resemble the results of [21], where the entanglement-assisted communication 
capacity of Uf was used to lower bound the complexity of /; however, they are always at least as 
powerful. This is because of the identity E{U) + E{W) > Cf{U), where Cf{U) = max{Ci + C2 : 
(U) + 00 [gg] > Ci[c— >c] + C2[c<— c]} is the entanglement-assisted communication capacity of U 
[7, 16]. Moreover, there exist[13] unitary gates where E{U) can be exponentially larger than Cf{U) 
and so the spread technique gives correspondingly better bounds, although no distributed functions 
corresponding to such unitaries are known. 

There is one weakness to this lower bound technique, which also appears in the related Renyi- 
entropy-based lower bounds of [22]. While the lower bounds apply even when unlimited EPR pairs 
are allowed, they no longer hold when protocols for / are assisted by embezzling states or other 
entangled states with nonzero spread. However, in future work, I will prove that communication 
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capacities differ by at most a constant factor between the EPR-assisted and the embezzhng-assisted 
cases. 

3.3 The Quantum Reverse Shannon Theorem on general inputs 

If Shannon's noisy channel coding theorem is thought of as using a noisy channel to simulate a 
noiseless channel, then the reverse Shannon theorem uses a noiseless channel to simulate a noisy 
channel. When free shared randomness is allowed, this simulation can be performed using an 
asymptotic rate of communication equal to the capacity of the channel [4, 23]. 

Similarly, the idea of the quantum reverse Shannon theorem (or QRST; originally proposed 
in [4]) is to simulate a quantum channel using shared entanglement and a rate of communication 
equal to its entanglement-assisted capacity. So far this simulation has only been shown possible 
when the inputs are tensor powers[l,6, 10] and in a few other special cases. However, in [6], we will 
show that for general inputs a quantum channel cannot be simulated using unlimited EPR pairs 
and a rate of communication equal to its capacity. In this section, I'll give a sketch of why this is 
true. The main problem is that simulating channels generally requires creating linear amounts of 
spread, for which an an additional resource is necessary, such as extra communication (which could 
be from Bob to Alice), an embezzling state or some other non-standard entangled state. 

We can even use Lemma 2 to obtain the optimal rates for the case of general sources. To see this, 
we first review the rates for the classical reverse Shannon theorem for general sources with feedback 
(meaning that Alice learns Bob's output). This simulation task requires C := maxp/(^;i?)p cbits 
and R := maxp H[B)p—C rbits (bits of shared randomness) [4, 23]. Herep is some input distribution, 
I{A : B)p is the mutual information between random variables A and B where A is distributed 
according to p and B is given by passing A through a noisy channel, and H{B)p is the entropy 
of B in the same setting. Note that the rbit cost may be smaller than maxp H{B) — I{A] B) = 
mayipH[B\A)p since for some input distributions I{A;B)p may not be maximal, and thus some 
of the classical communication can be used to substitute for randomness. Thus, in considering 
the worst input distributions, we should look at their cbit cost and their total cbit-l-rbit cost and 
maximize each separately. If we give up the feedback requirement then there is now a somewhat 
more complicated cbit/rbit tradeoff curve, but the feasible resource region for general sources is 
still simply the intersection of the feasible resource regions of each possible i.i.d. source. 

For the quantum reverse Shannon theorem this is no longer true, due to the cost of creating 
entanglement spread. For simplicity we will discuss the QRST with feedback, meaning that, instead 
of working with a noisy channel M, we use its isometric extension Uj\f and give to Alice the part 
of the output which normally would go to the environment. When we restrict t/v" to a source with 
average density matrix p, then the optimal simulation protocol corresponds to the RI 

I {A; B)p[c ^ c] + S{B)p[qq] > {Um : p). 

Here the quantities I{A] B)p and S{B)p refer to a state in which p^' is purified into a state 1$)"^"^', 
and then A' is sent through Uj\f to obtain outputs for B and E. However, for general sources — 
or even for locally distinguishable mixtures of tensor power states — the communication cost is not 
simply maXp I{A; B)p, even when unlimited EPR pairs are allowed. The problem is that the H{B)p 
entanglement cost varies from source to source, and naively running each fixed-source protocol 
in superposition will consume varying amounts of entanglement, leading to decoherence between 
different branches of the superposition. 

In order to run different fixed-source channel simulations in superposition, we then need to use 
some source of entanglement spread, such as extra communication or an embezzling state, to obtain 
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superpositions of different amounts of entanglement. For concreteness, suppose that we want to 
simulate {Uj^ : p) using Ci[c — > c] + C2[c <— c] + E[qq\. Then this is possible if and only if 

Ci > max I{A; B)„ E > max H{B)a 
p p 

C2>E- min [H{B)^ + min(0, 1{A; B)„ - Ci)] 
p 

Similar tradeoffs can be derived for any combination of input resources. The proof that these 
resources are both necessary and sufficient will follow from Lemma 2; we need only the additional 
fact that any simulation protocol can be converted into a clean simulation protocol, modified only 
by the possible addition of some pure entanglement in the output. 

One additional tool will be needed for completely general inputs, as opposed to mixtures of 
tensor powers: by analogy to the classification of bit strings by their types (i.e. frequencies of 
different symbols), the full QRST will decompose the input in the Schur basis, which splits it 
into different irreducible representations of the unitary and symmetric groups. Full details of the 
construction and its proof of optimality will be in [6]. 

4 Conclusion 

The problem of entanglement spread shows that even bipartite pure state entanglement can retain 
some surprises. Moreover we have seen in the QRST that the idea of spread can yield precise and 
non-trivial statements even when dealing with mixed states and noisy channels. 

One major unresolved question about spread relates to its interconvertibility. While some 
resources, such as classical communication and embezzling states, are more or less universal sources 
of spread, it is still possible that spread exists in many incomparable forms. For example, consider 
the problem of transforming into IV'2)®"' for some integers m, n and partially entangled 

states ,\'il)2) ■ If Alice and Bob can communicate classically (even 0{y/n) bits) then from [2] 
we know that it suffices to use m = nE{ip2)/E{^i) + o{n) copies of \tpi), and it is necessary 
to use at least m = nE{'ip2)/E{ipi) — o{n) copies. But is this amount of communication always 
necessary? However, the current formulations of entanglement spread give only partial answers to 
this question: in some cases ^t{^/n) classical communication is necessary[ll], but it is unknown if 
this holds generically, and how it depends on m. 

A more challenging problem is to determine the extent to which shared EPR pairs or shared 
embezzling states can reduce the quantum communication complexity for a distributed function. 
Many examples are known in which access to EPR pairs can dramatically reduce the classical com- 
munication complexity[12], but much less is known about the question of quantum communication 
complexity. The largest known separation between the quantum communication complexity with 
and without shared EPR pairs is a mere factor of two, in the case of the equality function [24]. On 
the other hand, the only known upper bound on the usefulness of shared entanglement is that the 
classical communication complexity without shared entanglement is at most exponentially greater 
than the (quantum or classical) communication complexity with shared entanglement. 

A related question, originally posed in [22], is to determine whether shared embezzling states 
can ever be more useful in reducing communication complexity than EPR pairs can. In a future 
paper I will prove that communication complexity with shared EPR pairs is at most a constant 
factor higher than with embezzling states (or indeed any shared entangled state). 
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